On the probabilistic formulation of the replica approach 

to spin glasses 

Giorgio Parisi 
Dipartimento di Fisica, Universita La Sapienza 
INFN Sezione di Roma I 
Piazzale Aldo Moro 2 , Roma 00185, Italy 

February 1, 2008 

Abstract 

In this paper we review the predictions of the replica approach on the probability 
distribution of the overlaps among replicas and on the sample to sample fluctuations of 
this probability. We stress the role of replica equivalence in obtaining relations which 
do not depend on the form of replica symmetry breaking. A comparison is done with 
the results obtained with a different rigorous approach. The role of the ultrametricity 
and of other algebraic properties in discussed. It is shown that the ultrametric solution 
can be obtained from a variational principle. 

1 Introduction 

The mean field theory for spin glasses has been originally derived by finding the solution of 
the Sherrington Kirkpatrick model [[l], § . The replica method was used in the first derivation. 
Probabilistic tools (based the cavity method) can be used instead of the replica approach, 
the formalism being more esplicite at the price of being more heavy |l|, [2|. |3|] . 

The probabilistic method and the replica approach are conceptually equivalent. In both 
cases the ultrametric structure of the states was assumed from the beginning. In order to 
present a definitive proof of the proposed solution, one should prove that the ultrametric 
Ansatz is the unique stable solution of the mean field equations, or, if there are more stable 
solutions, one should show that the ultrametric one has be chosen. This proof is lacking 
because of the difficulties in considering a generic non ultrametric Ansatz. 

The aim of this note is to perform a small step in this direction by combining probabilistic 
methods and the replica techniques and by showing that the usual ultrametric solution can 
be selected by using a variational principle. In section II and III we briefly review to replica 
technique and the meaning of spontaneously broken replica symmetry. In section IV we 
spell out some of the consequences of replica equivalence and its connections with rigorous 
results. In section V we study ultrametricity and in section VI we show how the ultrametric 
solution may be singled out using an appropriate variational principle. In section VII we 
mention the existence of two extra symmetry, translational invariance in replica space and 
separability, which strongly constrain the probability distribution. 
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2 The real replica approach 



In this paper many of the considerations will not depend on the detailed form of the interac- 
tion. We will study a system whose dynamical variables are Ising spins (cr, = ±1, i — 1, N). 
The total number of degrees of freedom is iV which is assumed to be quite large but finite. 
The system is at thermal equilibrium at a temperature T = (3~ l . 

The Hamiltonian will depend on some control variables J. A standard procedure consists 
in considering an ensemble of systems, which differ by the value of the control parameters 
J and to compute the average over J [|]]. In many cases, i.e. spin glasses, the number of 
parameters J is proportional to N; an other possibility consists in considering a fixed value 
of J and averaging over N inside a given interval (e.g. Nq — A<N<Nq + A with A large 
||). In the following the average over the J variables (or over N) will be denoted by a bar. 

Let us firstly study what happens at a given value of J (or N). It is useful to consider m 
replicas of the same system (m being arbitrary, may be infinite) and to define the overlaps 

q,,t = , (1) 

where s and t denote the replica. 

Here the number of replicas m is a positive integer, which remain fixed and will not go to 
zero at the end of the computation; this number should not be confused with n, the number 
of replicas that are used to compute the statistical expectation values, which eventually 
is taken to be equal to zero. These replicas are just m copies of the same system; their 
properties are in principle experimentally observables (they can be studied with numerical 
simulations) and some times they are called real replicas || . 

We want to explore the possibility that the quantity q still fluctuates also for very large 
(obviously finite) systems. For each choice of the parameters J we can study the probability 
distributions of the overlaps. We can define various functions, the simplest one Pj(q) defined 

/ dqPj(q)f(q) =< f(q 1;2 ) >= < f(q s , t ) >j. (2) 

where s and t are arbitrary different replicas (all replicas are intrinsically equal!). In other 
words Pj{q) is the probability that two different equilibrium configurations have overlap q. 
In we consider more overlaps together, we have to introduce other probability distribu- 
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tions: e.g. Pj ' ' (q 1>2 , g 2j3 , <?3,i) defined as: 

J dq lj2 dq 2i3 dq 3il P} 2t23 ' 31 (q h2 , 92,3, 93,0/(91,2, 92,3, <?3,i) =< /(<7i,2, 92,3, £3,1) >J • (3) 

There are other probability distributions which can be trivially obtained as functions of 
others. An example is given by pj 2,34 (g 12 , ^3,4) defined as 

J dq li2 dq 3A P] 2 ' U (q h2 ,q3A)f(qi,2,q3A) =< /(<?i,2, <?3,4) >j • (4) 

It is obvious that all the configurations a? have the same probability independently from the 
value of s, so that the probability distribution of the overlaps will be symmetric under the 
exchange of the replica indices. We thus have 

^(91,2) = ^(93,4), P} 2 ' 34 (qi,2, g 3 , 4 ) = P.Mi,2)PM A ). (5) 

By increasing the number of replicas we find that more and more probability functions 
are needed. Each system (for given J and N) may have the functions P which differs from 



those of an other system. Indeed at low temperature, where these functions are not given by 
a single 5 function, they change of a quantity of order 1 when the total Hamiltonian changes 
of a quantity of order 1 (e.g. when we change the form of the Hamiltonian in one point). 

The actual form of the functions P for a given systems has the disappointing property 
of not being thermodynamical stable, i.e. it changes completely under any perturbation. As 
we shall see later, this is the price to pay in order to have that the probability distribution 
of the functions P. i.e. V[P], is stable with respect to a wide class of perturbations f7|, Kl- 
in order to get a feeling of the meaning of the fluctuations of the overlap, let us suppose 
that the Boltzmann average for finite but large N can be decomposed into different pure 
states or valleys: 

< • >=E W « < • >a (6) 

a 

and the expectation value in each of the pure state a satisfy the cluster property for the 
correlations functions, i.e. intensive quantities do not fluctuate. 

The name pure state is abusive from a strict point of view: intensive quantities do always 
have fluctuations of order iV _1 and for N fixed (albeit large but finite) we do not have any 
criterion (apart common sense) to distinguish fluctuations which are of order 1 from those 
that are of order jV -1 . 

In the usual ordered case one can easily define the states for the actual infinite system. 
In disordered systems the limit N — > oo may be rather tricky and it is quite possible that 
the expectation values of local observables do not have a limit (i.e. they oscillate) when 
N — > oo. Here we do not want to enter into rather sophisticated mathematical arguments 
and we limit ourselves to study finite and large systems and to perform the limit iV — ► oo 
only after the average over J (or N). 

If we assume that equation (|6]) can be approximately written for also for finite system, 
we can approximately write (using the asymptotic vanishing of the connected correlations 
within a pure state): 

pAq) = Y. w ^ w i 5 ^ an -?)> 

Pj 2 ' 23 ' 31 (Qi,2, g2,3?3,i) = Yl w aW y Wp5(q ail - qi, 2 )S(q a ,p - g 2l 3)<%/3 l7 - <?3,l)- ( 7 ) 

In general these probabilities are J-dependent. It is interesting to consider their average 
over the J's, which for simplicity we will indicate removing the suffix J: 

P{qi,2) = PAQI2), P 12 ' 23 ' 3 \qi,2, g 2> 3, g 3 ,i) = Pj 2 ' 23 ' 3 \qi,2,q2,3~^i) , 

P 12 ' 34 (gi, 2 ,g 3i 4) = Pf 34 (gi, 2 ,g3,4). (8) 

The number of independent probability distributions is infinite. The replica method of 
next section consists in coding all this infinite set of functions into a n x n matrix Q. The 
value of n is originally an integer. At a later stage non integer values of n are allowed and 
eventually the limit n —>■ is performed. 

3 Breaking the replica symmetry 

The replica method uses an x n matrix Q. The free energy can be written as function of Q 
in the limit n — > and is invariant under the action of the permutation group of n elements 
acting on indices of Q. In this section we will study the relations among the properties of 
the matrix Q and the probability distributions introduced in the previous section. 



When replica symmetry is exact all the off diagonal elements of the matrix Q are equal 
(the diagonal elements of Q are equal to zero). When replica symmetry is spontaneously 
broken, we must consider a matrix Q which is not invariant under the permutations of 
replicas. If II is a permutation and 11(a) is the value of the index a after a permutation, the 
value of 

Qu(a),U(b) = Qa,bi ( 9 ) 

for fixed a and 6, depends on the permutation II. We can thus introduce the function P R (qi, 2 ) 
which is the probability distribution of the quantity Q^ 2 averaged over all n\ permutations 
IT with equal weight. In other terms we define the function Pr(oi 2) as follows; 



En/(Q? 
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J dq h2 P R (q h2 )f(q h2 ), (10) 
/ being an arbitrary test function. In the same way we can define other functions, e.g. 

^7 — = J dq lt2 dq 3A P R ' (gi, 2 , 93,4)/ (91,2, 93,4)- (11) 

For each matrix Q for a given n we define a set of probability functions Pr, which are in one 
to one correspondence with the previous introduced functions [J 

The main assumption of the replica method is that there exists a matrix Q such that for 
a given value of n (in particular for n — 0) the Pr functions defined in this way do coincide 
with the homologous probabilities P previously defined with the real replicas. The case 
n = seems rather strange; it is defined by considering the previous formulae for integer n 
and perform the analytic continuation to n — 0. At the present moment it is not clear how 
much restrictive this hypothesis is. The space of all possible matrix Q analytically continued 
at n = is infinite dimensional and it is not easy to prove that a given set of P functions 
cannot be obtained in this way. 

In other words we have traded an infinite matrix q a ^ (if m = 00), which fluctuates also 
for very large systems, with a matrix n x n matrix Q (which does not fluctuate) in the limit 
n — > 0. We have coded the infinite number of functions, which characterize the probability 
distribution of q, in the structure of the matrix Q. 

As we said in the introduction the only well studied example of matrix Q is such that the 
probability distribution of the overlaps is ultrametric, i.e. if qi t2 > qi^ and P(qi, 2 , q 2 ,3, 03,1) > 
then g 2 ,3 = <?i,3- I n this ultrametric scheme the relevant quantity is P(q\ y2 ) and (as we shall 
see later) all other high order functions can be obtained from it by algebraic manipulations 

Bil- 



4 Replica Equivalence 

4.1 Introducing Replica Equivalence 

Let us now consider a more general case in which ultrametricity is not assumed. There are 
general arguments which imply that the form of the matrix Q is not completely arbitrary 
It has been argued that also when replica symmetry is broken the matrix Q must be 

1 If we are not interested to consider the overlap of a configuration with itself, that for Ising spin is 1, we 
can consider only the off diagonal terms of the matrix Q, which it is conventionally taken to be zero on the 
diagonal. 



1 



such that no replica has different characteristics from the others. Observables which involve 
only one replica must be replica symmetric. In particular we must have that 

£/(<M (12) 

b 

does not depend on a Q. This properties is called replica equivalence. It is likely connected 
to the fact that the expectation values of intensive (J- independent) quantities do not change 
from one state to the other. 

This hypothesis implies automatically in the replica formalism that the free energy is 
finite. Indeed in the replica approach the free energy can be written as function of the 
matrix Q (see eq. (^). If replica equivalence is correct, the term 1/n in the free energy 
cancels automatically with the sum over n. Moreover adding the appropriate terms in the 
Hamiltonian we can generate an extra term proportional to 

J2a,bf{Qa,b) (13) 

n 

This term is automatically finite if condition (|T2"D is satisfied. 

It interesting to note that this argument is quite similar to those which has been used 
by Guerra [[/J in his rigourous derivation of some of the following equations. The replica 
equivalence conditions is also deeply related to the stability conditions of Aizenman and 
Contucci [§]. 



4.2 Two overlaps 



Condition (12 ) has interesting consequences. In the simplest case we can consider the fol- 



lowing equations: 

E = (E Qtf = J dqiPiqM 1 J dq 2 P{q 2 )q\\ (14) 



c,d 



The previous equation holds both for a = b and for a ^ b. Let us spell its consequences 
in the first case. Its l.h.s can be written as 

E +EM {Q&Qtb)- (is) 

^c,d;cj^d c,d J 

In the first sum there are (n — 2)(n — 1) non zero terms (Q a , a = 0) and in the second sum 
there are (n — 1) non zero terms. Each term of the first and second sum give a contribution 
equal respectively to 

/ dq 1>2 dq 1 , 3 P 12 ' 13 (q 1 , 2 ,q 1>3 )q k 1 ^% , J dq^P{q^)q k ^ . (16) 

Putting everything together we obtain the relations 

P 12 ' 13 (gi, 2 , <Zi >3 ) = ^(91,2)^(91,3) + ~P(<2i, 2 )<%i,2 - <7i, 3 ). (17) 

A similar equation can be obtained if we consider the case a 7^ b. Indeed after a similar 
algebra and using the previous results we get 

2 This is essentially equivalent to assume that each line is a permutation of other lines, which would be 
true for example in the case of Knuth's combinatorial matrices . 
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P 12 ' 34 (gi, 2 , g 3 ,4) = ^(91,2)^(93,4) + ^(91,2)^(91,2 - 93,4). (18) 

These relations have a peculiar standing because they have been proved by Guerra [|7| 
under quite general conditions. We can thus safely assume that they are correct, without 
any further reference to the replica method. 



4.3 Three overlaps 

Using the same method as before we obtain that replica equivalence implies the following 
relations: 

6 pl2,13,14 (?i)g2j?3) = p( 9l )p( g2 ) P ( g3 ) + 

5(qi - 92)^(91)^(93) + 5(9i - 93)^(91)^(92) + 5(92 - 93)^(91)^(92) + (19) 

25(q 1 - 92)5(92 - 93)^(91) • 

6 pl2,13,34 (gi) gs) = 2 pl2,23,31 (?i) ^ ^ + pfajpfapfa) + 

5(9i - <h)P(qi)P{<h) + 5(92 - 93)^(91)^(92) + 6{ qi - q 2 )S(q 2 - q 3 )P{ qi ) , 

12 pl2,34,15 (gi; ?3) = 2 pl2,23,31 (gi) fe ft) + 3p ((?1 )p (?2 )p( g3 ) + 

5(9i - 9 2 )P(9i)P(9 3 ) + 35( 9l - 93)^(91)^(92) + 5(g 2 - 93)^(91)^(92) + (20) 

25(9i - 92)5(92 - 9s)P(9i) , (21) 

15P 12 ' 34 ' 56 ( gi ,g 2 ,g 3 ) = 2P 12 < 23 < 31 ( gi ,g 2 ,g 3 ) + 5P( gi )P(q 2 )P(q 3 ) + 
+2%i - <h)P{qi)P{q 3 ) + 26{ Ql - q 3 )P{qi)P{<h) + 25(g 2 - 93)^(91)^(92) + (22) 

25(gi - 9 2 )5(g 2 - g 3 )-P(?i)- 

Some of these relations have been derived in || using a different technique. 

It is remarkable that all the probability functions involving three overlaps can be com- 
puted from that with one overlap plus p 12 > 23 > 31 . This last function if quite important because 
it contains the main information on the validity of ultrametricity. 



4.4 Further consequences 

If we look in more details to replica equivalence there are further relations that must be 
satisfied. For example we could impose that 

T a = £ i>tq k c % (23) 

6,c 

does not depend on a. 

For example we could spell the consequences of the condition 

T T 2 V T 

£-~ia a / ° \2 (24) 

n n 

If we proceed as before we find that 

6 pl2,23,31,14,45,51 (gij &> ^ ^ fo ^ + pW^ ^ _ _ _ ^ = 

P 12 ' 23 ' 31 (9i, 92, 93)P M ' 45 ' 51 (94, 95, 9e) + (25) 

+3( pl2, W(?l) g3) gg)5(g4 _ fe) + pl2,23,31,14,43 (gi; ^ ^ ^ _ gg) 

Many relations of this kind may be derived and it is quite likely that all the equation that 
can be derived from replica equivalence can also proved using the method of [0, §]. 



5 Ultrametricity 



The ultrametricity condition states that the function p 12 > 23 > 31 can be written as 

P 12 ' 23 ' 31 (gi2,te,gi 3 ) = 

%i2 - gi3)<%23 _ Qi3)A(qi2,qi3) + 2 cyclic permutations (26) 

+B(q 12 )5(qi2 - gi3)%23 - 913) 



It is already been shown JTSI that this condition and the previous relations, in particular 
eq. fllTD , implies that 

^(g 12 ,gi,3) = P(ii2)P(qi,3), 

B(q 12 ) = x{q 12 )P{q 12 ) (27) 

x(q)= f 9 dq'P( q '), (28) 
Jo 

and we have assumed for simplicity that the support of the function P(q) is within the 
interval [0-1]. 

This relation has been proved |T3[ by noticing that 



where 



fdq 
Jo 



23 P 12 ' 23 ' 31 (gi 2 ,g 2 3,gi3) = P 12 ' 13 (gi 2 , gi 3 ). (29) 
The r.h.s is given by eq. fll7[) while the l.h.s. is given by 

-4(912, qn) + J qi2 l dq'{A(qi2, q')5(q 12 - 913) + B(q 12 )5(q 12 - qi 3 ))- (30) 



We finally find that the usual result (|27|) is correct. 

In the same way it can be proved that in the ultrametric case also the other probability 
distribution involving more overlaps e.g. P 12,23,34,41 (g 12 , g 23 , g 34 , g 41 ) are uniquely determined 
in terms of the only independent function P(q). 

The argument is quite similar to the previous one. Ultrametricity implies that 

P 12 ' wl (gi 2 ,g 23 ,g 3 4,g4i)= (31) 

%i2 - <?i3)%23 - <?34)%34 - qu)A(q 12 , q 2 3, ^4) + permutations + delta funtions. (32) 
On the other hand 

J dg 4 iP 12 ' 23 ' 34 ' 41 (gi 2 , g 23 , g 34 , to) = P 12 ' 23 ' 34 (gi 2 , <? 23 , g 34 ) (33) 

and the r.h.s is known from replica equivalence. 

Generally speaking ultrametricity implies that the probability distribution of the overlaps 
of m replicas, which is a priori a function of m(m — l)/2 variables, depends only onra-1 
variables and therefore, using the relations coming from the replica equivalence, is determined 
by the probability distribution with a smaller number of replicas. 

Given the function P(q) there is one and only one ultrametric set of probabilities satisfy- 
ing the principle of replica equivalence. We shall see in the next section how the ultrametric 
distribution may be characterized using a variational principle. 



6 A variational approach 



6.1 The variational equations 

We now apply this formalism to the study of the infinite range Sherrington Kirkpatrick 
model near T c . 

The model is defined as follows: 

H = - Ji,k<7i&k, (34) 

^ i,k=l,N 

where the J's are random Gaussian variables with zero average and variance iV -1 / 2 . 

In the replica approach the free energy density can be obtaining by finding the minimum 
with respect to the matrix Q the function F(Q) defined as: 

F(Q) = ^TrQ 2 - -ln(]>>xp(]rQ aife a a a 6 )). (35) 

^ ^ cr a a,b 

The definition of minimum is rather tricky: we say that F(Q) has a minimum if its Hessian 

Hab ' cd = di^o7 d (36) 

has non negative eigenvalues. This condition may result as a minimum or as a maximum 
condition as function of the parameters on which the matrix Q depends. 

Near the critical temperature, where Q is small, F(Q) can be approximated (neglecting 
terms which do not play a crucial role) as: 

f(Q) = ^(-t^W 2 ) + l^(Q 3 ) -\Y,Q*) ■ (37) 

ab 

Let us concentrate our attention on the function F(Q). Its value is given by 

W[V\ = J dq 12 P(q 12 )C-ql 2 + |^ 2 ) ~\ J dq 12 dq 13 dq 23 P 12 ' 23 ' 31 {q 12 , q 13 , q 23 )q l2 qnq23, (38) 

where we indicate with V the whole set of P functions, which can be computed in terms of 
the matrix Q. Near T c , where we can approximate the F(Q) with a polynomial, only a finite 
number of P functions are relevant. 

Now there is a apparently strange phenomenon. We have remarked that the matrix Q is 
a compact form for coding the the probability functional V and it does not contains extra 
information. We would thus expect that the equations we get from the condition 

dW 

= (39) 



dQ a 

are the same that those obtained from the condition 

5W 



0. (40) 



6V 

This is not the case. Equation (^) implies that 

TQa,b + VQlb = E Qa,cQc,b- (41) 



This equation correspond to an infinite set of equations for the probability distribution. For 
example if we multiply it by Q* b for arbitrary k we arrive to the equation 



P(q)(rq + yq 3 ) = J dsdtP 12 ' 23 > 31 (q, s, t)st (42) 

If we square it and we multiply again by Q k a b for arbitrary k we arrive to the equation 

P(q)(rq + yq 3 ) 2 = J dsdtP 12 ' 23 ' 31 (q, s, t)s 2 t 2 
+ j dsdtdudvP 12 > 23 > 31 > 2AA1 (q,s,t,u,v)stuv. (43) 

On the other hand the equation (|40|) depends only on the probabilities with one or three 
overlaps. Different forms of the matrix Q which gives the same probability distributions 
with one and three overlaps, will give the same value of the free energy. 

In the usual approach we do not see any difference among equations (0) and fl3"9f ). The 
key point is separability (discussed in the next section) which is satisfied in the usual ap- 
proach. Separability implies that the equation ( fi~3|) is a consequence of equation ([42"f) . In the 
usual approach the construction of the matrix Q has been done using symmetry principles. 
The matrix Q depends on a sets of parameters q and the equation (|39D is equivalent to 



dF 

*- a < 44 > 

If we work from a general point of view, the two equations looks rather different. It is clear 
that (^) contains more information than (|4"0| ) if we stay in case in which the separability 
conditions does not hold, however the last one should be enough to compute the free energy 
in any case. 

Let us try to spell out the consequences of eq. ( f4~0l) . It is clear that we cannot change all 
the P in an independent way. 

• The probability with more than one overlaps are obviously related to that with less 
overlap. 

• We have the constraint that the probabilities are not negative. 

• On top of all these requirements Guerra's relations should be satisfied. 

We propose the following variational principle which should equivalent the the usual one: the 
free energy and the correct probability distributions are obtained by finding the maximum of 
the functional W[P] with respect to the function P(q) and finding the minimum with respect 
to the other function p 12 > 23 > 31 . This principle is applied in the region where all the previous 
constraints are satisfied. In other words we suppose the probability are non negative and 
that 



/ ^i, 2 P 12 ' 23 ' 31 (gi,2,gi, 3 ,g 2 ,3) = P 12 ' 23 (gi, 2 ,g 2 , 3 ) = -P(?i, 2 )P(gi )3 ) + -P(<?i, 2 )%i, 2 - ?i, 3 ) 

(45) 

The equation (f45|) is extremely important because it tell us we cannot change the function 
P 12 ' 23 ' 31 {qi,2, $1,3,12,3) in an arbitrary way. 

It may looks strange to maximize with respect to one parameter and to minimize with 
respect to an other, but this is quite a common situation in the replica approach [|TJ. 



n 



We will argue now that this variational principle gives the usual ultrametric solution. 
It is clear that if ultrametricity is assumed, we recover the older approach, where the free 
energy is maximized with respect to P(q). The only think we have to prove is that the 
minimization with respect to p 12 ' 23 ' 31 at fixed P(q) implies ultrametricity. In order to see 
how it could be done let us consider two simple case. 



6.2 An example: the overlap has two possible values 

In the first case the overlap can take two values go with probability po and q± with probability 
Pi (we suppose go < gi)- For the three overlaps we have four possibilities (all three are 
equal to gi, two to gi and one to go . . . ) and the corresponding probabilities are denoted 
Pm,Piio,Pioo,Pooo with obvious notation. The free energy is thus 

i~ y t y 1 

Poi^Vo + ^o) + Pi^Si + 4?i) ~~ gCPmSi + 3 Pno9i9o + ^Piooqiqo + Pooo9o) ( 46 ) 

We have that: 

Pm + Pno = Pn = ^(Pi+ Pi), 

Pno + Pioo = Pw = ^PiPo, (47) 

Pwo + Pooo = Pw = ^(Po + Po)- 

The second equalities of each line follow from Guerra's relations. The previous relations can 
be written as 

Pm = ^(pI+Pi) -u, 

Pno = u, (48) 

1 

Pioo = -jPiPa ~ u, 
Pooo = pI~u. 

Let us concentrate our attention on the dependence of the free energy on u. We find that 

F(u)=F(0) + u( qi -q ) 3 . (49) 

The coefficient of u is positive and the minimum of the free energy with respect to u is 
located at the minimum value of u, i.e. u — 0. It is crucial to note that u = is just 
the condition which follows from the ultrametricity condition. Using the result u — the 
remaining parameters and be computed as usual. 



6.3 An other example: the overlap has three possible values 

The same phenomenon happens if we consider a more complicated situation: three of more 
values of the overlap. Let us study what happens for three possible values of the overlap. 
After some algebra one finds 

P222 = ^{P2 + p\) - a - b, P221 = a, p 2 2o = b, 



i n 



P2H = ^PiP2 -a-c, P210 = c, P200 = ^P2Po - b - c, (50) 
pin = -(2p! - 2pip 2 - P0P1) + a + c-d, p no = d, 
P100 = ^PiPo - c. pooo = Po ~ d + & + 2c ' 
One finds finally the free energy depends on a, b, c and d as follows 
F(a, b, c, d) = F(0, 0, 0, 0)+a(g 2 -gi) 3 +% 2 -go) 3 +rf(gi-go) 3 +c(gi-go) 2 (3g 2 -gi-2go)- (51) 

All the coefficient are positive definite in the region where go < Qi < <?2- The minimum of the 
free energy is reached ata = b = c = d = 0, and the ultrametricity conditions are satisfied. 

In general the distribution probability of three replicas can be written as a form which 
satisfies the ultrametricity relations plus a reminder. The dependence of the free energy 
on the reminder is linear and therefore its minimum is located just at the boundary of the 
allowed region. The sign structure is crucial to obtain that the boundary which correspond to 
a minimum is located at the point at which the ultrametricity violations have the minimum 
possible value (i.e. zero). Once that the ultrametricity structure is recovered, the expression 
for the free energy is the usual one. 

We are thus in the strange situation: in order to obtain the ultrametric solution we must 
maximize the free energy with respect to the function P(q), but minimize it with respect to 
the other parameters which describe the violations of the ultrametricity. 

On the other hand, the other possibility, the one which correspond to maximize every- 
thing, produces a maximal violation of the ultrametricity which would give rather nonsensical 
results. Indeed in the first case we would get pioo = and puo > 0, which would clash with 
the interpretation in terms of states. If q^ — Qi 1 an d 2 belongs to the same state and if 
<7i,3 =9i 1 and 3 belongs to the same; by transitivity 1 and 3 must belong to the same state 
and consequently q i3 = q\. 

It is also not clear if the non ultrametric solution does have a replica interpretation, i.e. 
there is an associated matrix Q. 

It is likely worthwhile to look again to the equations coming from the cavity approach 
using as far a possible a general approach in which the ultrametricity is not assumed in 
order to see if there exists a non-ultrametric solution. Equation (|40|) or equivalently its 



consequences eqs. ([|1],[I2|) should be derivable in the framework of the cavity approach. It 
may be possible that subleading terms must be considered in order to obtain interesting 
results. 



7 Other algebraic structures 

We have seen in the previous section that ultrametricity is an extremely powerful constraint 
and that there is only one ultrametric probability distribution at fixed (P(q)). In this section 
we explore other algebraic properties that are present in the usual solution. The aim is to 
characterize as far as possible the probability distribution in order to find how much space 
there is for non ultrametric solutions, which should still have good properties. 

The properties which we are going to consider are translational invariance in replica space 
and separability. 
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7.1 Translational invariance 



Replica equivalence implies that all replicas are equivalent. The simplest way to implement 
it is to assume that there a group G (a subgroup of the group of permutation of n elements) 
which leaves the matrix Q invariant and acts transitively on the space of indices (i.e. for 
any pairs a, b there is element g of the group such that g ■ a = b) fp70| . 

The simplest group we can think of, as suggested by Kondor |T]J, is the translational 
group, i.e. the group generated by the operation a — > a + 1 (all sums and differences are 
done modulo n, in this section). It was later realized by Sourlas ]12|, [U| that by reshuffling 



the indices the usual form of matrix Q in presence of replica symmetry breaking can be put 
in such a way to be translational invariant. In this new base we have 

Q a ,b = q(a-b). (52) 

Translational invariance is quite useful at the technical level: the matrix product becomes 



now a convolution which can be easily done by Fourier transform JTJ, |TJJ]. Using this fact, if 
we call s the variable of the Fourier transform, one finds that there exists a function F(q, s) 
such that 

J dfi(s)F(qi, s)F(q 2 , s) = P{q x )8{q 1 - q 2 ), 
J dn(s)F( qi , s)F(q 2 , s)F(q 3 , s) = P 12 ^ 31 (q u q 2 , q 3 ). (53) 

Similar relations can be found for probabilities involving an high number of q (i.e for 
pi2,23,34,4i^ pi2,23,34,45,5i an( j g0 on These relations are not on the same standing of the ones 
derived in the previous sections because they do not allow us to write algebraic equations 
involving a finite set of probability distributions; they however strongly constrain the set of 
allowed probability distributions. 

At this stage it is not clear how to exploit the consequences of translational invariance 
on the probability distribution. It seems reasonable to assume that not all the probability 
distributions which satisfy replica equivalence can be generated by a matrix Q a b which is 
translational invariant, but it is no so easy to find a counterexample. It is not clear if the 



alternative scheme introduced in |T0[ can be extended in such a way to have a probability 
interpretation. 

An answer to these questions would be interesting, because it would help us to elucidate 
the origine of the rather mysterious translation invariance in replica space. 



7.2 Separability 

Separability (or non degeneracy) correspond to the following algebraic statement. Let us 
consider all the matrices which can be generated from the matrix Q in a permutational 
covariant fashion. Some example are 

Qab' QacQcb, Q acQ adQ cdQ cbQ db- (54) 

c c ,d 

Separability states that if we take two pair of indices (ab and cd), we have that 

Qab = Qcd M ab = M cd (55) 

where M is a generic matrix of the set generated by the rules (|54]) . In other words pairs of 
indices which have different properties have a different values of Q. 
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In the usual approach when replica symmetry is broken there is subgroup of the group 
of permutations that commutes with the matrix Q. Let us consider the orbits in the space 
of pairs of indices. It evident that the values of the elements of the matrix Q and of any 
matrix derived using the rules are constant of the orbits. If we assume that different orbits 
have different value of the matrix Q we obtain the condition fl55[) . 

The separability condition is extremely powerful in determining the expectation values 
of higher order moments of the probability distribution. 

Let us study a simple example and let us consider a matrix M constructed with the rules 
(|54T). It is evident that if 



b 

we have that 



J2Qa,bM a ,b = J dqP{q)M{q)q k ) ^Q^Raj, = J dqP(q)R(q)q k (56) 



J2Q k a, b M a ,bRa,b = J dq P (g) M (g) R(q) q k . (57) 



b 

If we apply the previous formula to the case where M and R have the form 

Mab = E QacQcl Rab = £ (58) 

c c 

we find the rather surprising formula 

3^2,13,32,14,42^ ^ fe fe ft) = _ _ ^12,23,31 (g> ^ &) 

i0 P 12 ^ 31 ( g , gi , g2 )P 12 ^(g, g3 ,94) 
+2 ^ • ( 59 ) 

Similar results can be obtained for other probability distributions. 

Equation ([59]) is particular interesting because integrating over q it implies that 



3 pl3,32,14,42 (gi; ?4) = _ ga)5(g2 _ ( P ( gi )p( ?2 ) + J( gi _ g 2 )P(g 2 )) (60) 

P 12 ' 23 ' 31 (g,gi,g 2 )P 12 ' 23 ' 31 (g,g3,g 4 ) 



+2 y„ (61) 

This equation can also written in more suggestive form as 

3 pl3,32,14,42 (gi) g4) = 6{gi _ q3)6{g2 _ qi) ( P ( gi )p( ?2 ) + _ g 2 )P(g 2 )) (62) 

+2 / rfgP 12 ' 23 ' 31 (g, g 1? g 2 )P 23 ' 31 ' 12 (g 3 , ft|g), (63) 

where p 23 > 31 > 12 (g 3; g 4 | g) is a conditional probability. 

Now the l.h.s of eq. (0)is by definition invariant under cyclic permutation of the g's, while 
the r.h.s in not invariant for a generic choice of the function p 12 > 23 > 31 . A boring computation 
is needed to verify that if we insert the ultrametric result of the previous section, we find a 
symmetric result (as we should) for p 13 ' 32 . 14 - 42 . 

It not clear which is the most general form of the of the probability p 12 < 23 < 31 ; which gives 
is compatible with separability. We have checked what happens in the case of two or three 
possible values of the overlaps, discussed in the previous section. 

• In the case of two possible values of the overlap, the only non trivial relation is 

P 13 ' 32 ' 14 ' 42 (g , gi, go, a) = P 13 ' 32 < 14 ' 42 (gi, Ql , go, go). (64) 

An explicit computation shows that this relation is identically satisfied. 
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• In the case of two possible values of the same relation would is not identically satisfied, 
however it is satisfied for a = b = c with arbitrary d. 

The previous examples shows that separability does not imply ultrametricity in the gen- 
eral case, but impose rather strong constraints on the distributions functions P. 

This relations coming from separability are quite powerful in imposing further constraint 
on the probability distribution beyond replica equivalence. Their origine can be well under- 
stood in an algebraic framework, however it is not clear if they can be derived in a more 
general setting. It would be extremely interesting to check if they are satisfied in numerical 
simulations. 
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